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Dedicated to the memory of Philippe Flajolet 
Abstract 



We develop analytic tools for the asymptotics of general trie statistics, which are par- 
ticularly advantageous for clarifying the asymptotic variance. Many concrete examples 

^ are discussed for which new Fourier expansions are given. The tools are also useful for 

other splitting processes with an underlying binomial distribution. We specially highlight 

^ Philippe Flajolet's contribution in the analysis of these random structures. 

o 

CO 

1 Introduction 

> 

^ Coin-flipping is one of the simplest ways of resolving a conflict, deciding between two alter- 

natives, and generating random phenomena. It has been widely adopted in many daily-life 
situations and scientific disciplines. There exists even a term "flippism." The curiosity of 
understanding the randomness behind throwing coins or dices was one of the motivating ori- 
gins of early probability theory, culminating in the classical book "Ars Conjectandi" by Jacob 
Bernoulli, which was published exactly three hundred years ago in 1713 (many years after its 
completion; see [91, 98]). When flipped successively, one naturally encounters the binomial 
distribution, which is pervasive in many splitting processes and branching algorithms whose 
analysis was largely developed and clarified through Philippe Flajolet's works, notably in the 
early 1980s, an important period marking the upsurgence of the use of complex- analytic tools 
in the Analysis of Algorithms. 
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Figure 1: A tree rendering of the diverse themes pertinent to binomial splitting processes. 
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Technical content of this paper. This paper is a sequel to [54] and we will develop an ana- 
lytic approach that is especially useful for characterizing the asymptotics of the mean and the 
variance of additive statistics of random tries under the Bernoulli model; such statistics can 
often be computed recursively by 

X n — X In + X*^ In + T n , (1) 

with suitable initial conditions, where T n is known, X* n is an independent copy of X n and I n is 
the binomial distribution with mean pn, < p < 1. 

Many asymptotic approximations are known in the literature for the variance of X n , which 
has in many cases of interest the pattern 

V(X„) J P(wlogn), ifgfeQ 1 „, 

_ = c lo gn + c+ | 0i if g||^ Qj |+od), (2) 

where c may be zero, zu depends on the ratio and P(x) = P(x + 1) is a bounded periodic 
function. However, known expressions in the literature for the periodic function P are rare due 
to the complexity of the problem, and are often either less transparent, or less explicit, or too 
messy to be stated. In many situations they are given in the form of one periodic function minus 
the square of the other. Our approach, in contrast, provides not only a systematic derivation 
of the asymptotic approximation (2) but also a simpler, explicit, independent expression for P, 
notably in the symmetric case (p = q). Further refinement of the o(l)-term lies outside the 
scope of this paper and can be dealt with by the approach developed by Flajolet et al. in [34]. 



Binomial splitting processes. In general, the simple splitting idea behind the recursive ran- 
dom variable (1) (0 goes to the left and 1 goes to the right) has also been widely adopted in 
many different modeling processes, which, for simplicity, will be vaguely referred to as "bi- 
nomial splitting processes" (BSPs), where binomial distribution and some of its extensions are 
naturally involved in the analysis; see Figure 1 for concrete examples of BSPs that are related 
to our analysis here. For convenience of presentation, we roughly group these structures in four 
categories: Data Structures, Algorithms, Collision Resolution Protocols, and Random Models. 
To see the popularity of BSPs in different areas, we start from the recurrence (q — 1 — p) 

a n = TTn.fc (afc + ctn-fc) + &n, where 7T n ,k ■= ( T ^jP k q n ~ k , (3) 

which results, for example, from (1) by taking expectation. Here the "toll-function" b n may 
itself involve aj (J = 0, 1, . . . ) but with multipliers that are exponentially small. 

From an analytic point of view, the trie recurrence (3) translates for the Poisson generating 
function 

/(*)== c -*E£f* b > < 4 > 

n 

into the trie functional equation 

f(z) = f(pz) + f(qz)+g{z), (5) 
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with suitable initial conditions. Such a functional equation is a special case of the more general 
pattern 

where 6 = 0, 1, ... , and g itself may involve / but with exponentially small factors. When 
6 = 0, one has a pure functional equation, 

f(z) = af(pz + A) + 0f{qz + A) + g(z), (7) 

while when 6^1, one has a differential-functional equation. 

It turns out that the equation (6) covers almost all cases we collected (a few hundred of 
publications) in the analysis of BSPs the majority of which correspond to the case 6 = A = 0. 
The cases when 6 = and A > are thoroughly treated in [22, 80, 21, 56], and the cases when 
6^1 are discussed in detail in [54] (see also the references cited there). We focus on 6 = A = 
in this paper. Since the literature abounds with equation (5) or the corresponding recurrence (3), 
we contend ourselves with listing below some references that are either standard, representative 
or more closely connected to our study here. See also [17, 19, 41] for some non-random 
contexts where (5) appeared. 

Data Structures. Tries: [73, 77, 102]; PATRICIA tries: [61, 73, 101]; Quadtries and k-d tries: 
[32, 43]; Hashing: [20, 82, 39, 24]; Suffix trees: [60, 102]. 

Algorithms. Radix-exchange sort: [73]; Bucket selection and bucket sort: [76, 8]; Probabilistic 
counting schemes: [25, 30, 89, 27, 28]; Polynomial factorization: [39]; Exponential 
variate generation: [35]; Group testing: [47]; Random generation: [31, 94]. 

Collision resolution protocols. Tree algorithms in multiaccess channel: [3, 79, 83, 22, 80, 21, 
26, 103]; Initialization in radio networks: [85, 99]; Mutual exclusion in mobil networks: 
[81]; Broadcast communication model: [105, 49, 7]; Leader election: [23, 63, 92]; Tree 
algorithms in RFID systems: [53, 86]; 

Random models. Random graphs: [2, 46, 100]; Geometric IID RVs (or order statistics): [18, 
45, 74]; Cantor distributions: [10, 48]; Evolutionary trees: [1, 75]; Diffusion limited 
aggregates: [6, 78]; Generalized Eden model on trees: [13]. 

Asymptotics of most of the BSPs can nowadays be handled by standard analytic techniques, 
which we owe largely to Flajolet for initiating and laying down the major groundwork. We 
focus in this paper on analytic methods. Many elementary and probabilistic methods have also 
been proposed in the literature with success; see, for example, [14, 15, 62, 87, 102] for more 
information. 

Flajolet's works on BSPs. We begin with a brief summary of Flajolet's works in the analysis 
of BSPs. For more information, see the two chapter introductions on Digital Trees (by Clement 
and Ward) and on Communication Protocols (by Jacquet) in Philippe Flajolet's Collected Pa- 
pers, Volume III (edited by Szpankowski). 



4 



Flajolet published his first paper related to BSP in June 1982 in a paper jointly written with 
Dominique Sotteau entitled 1 "A recursive partitioning process of computer science" (see [38]). 
This first paper is indeed a review paper and starts with the sentence: 

We informally review some of the algebraic and analytic techniques involved in 
investigating the properties of a combinatorial process that appears in very diverse 
contexts in computer science including digital sorting and searching, dynamic 
hashing methods, communication protocols in local networks and some polyno- 
mial factorization algorithms. 

They first brought the attention of the generality of the same splitting principle in diverse con- 
texts in their Introduction, followed by a systematic development of generating functions under 
different models ([38, Sec. 2: Algebraic methods]). Then a general introduction was given of 
the saddle-point method and Mellin transform to the Analysis of Algorithms ([38, Sec. 3: An- 
alytic methods]). They concluded in the last section by giving applications of these techniques 
to one instance in each of the four areas mentioned above. 
Such a synergistic germination of diverse research ideas 



later expanded into a wide spectrum of applications and research networks (see Figure 2 for 
a plot of BSP-related themes). It was also fully developed and explored, and evolved into his 
theory of Analytic Combinatorics. Many of these objects become in his hands a subject of 
interest, and many follow-up papers continued and extended with much ease. 

Analysis of algorithms (and particularly BSPs) in the pre-Flajolet era relied mostly on more 
elementary approaches (including Tauberian theorems; see [41]), with some sporadic excep- 
tions in the use of the "Gamma-function method" (a particular case of Mellin transform): the 
height of random trees [11], the analysis of radix-exchange sort (essentially the external path 
length of random tries) given in [73, §5.2.2], PATRICIA tries and digital search trees [73, §6.3], 
odd-even merging [97], register function of random trees [66], analysis of carry propagation 
[72], and extendible hashing [20]. See Dumas's chapter introduction (Chapter 4, Volume III) 
for a more detailed account. 

Many asymptotic patterns such as (2), which most of us take for granted today, were far 
from being clear in the 1980's, notably in engineering contexts. For example, the minute pe- 
riodic fluctuations when log p/ log q is rational are often invisible in numerical calculations, 
leading possibly to wrong conclusions. Flajolet pioneered and developed systematic analytic 
tools to fully characterize such tiny perturbations, which he called "wobbles." See [29] for 
more information. 

Amazingly, most of the items in the big picture of Figure 2 were already discovered or 
clarified in the 1980's in Flajolet's published works with a few later themes aiming at finer 

'Note that the word "partitioning" is spelled as "partionning" in the title of [38], and as "partitionning" in the 
paper. 
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Figure 2: The diverse themes and methodology developed (or mentioned) in Flajolet's works 
that are connected to BSPs. 



improvements in results or more general stochastic models. Among these, the "digital process" 
and "probabilistic counting algorithms" became two of his favorite subjects of presentation, as 
can be seen from his webpage of lectures where about one third of talks are related to these two 
subjects. 

This paper is organized as follows. We briefly introduce tries, functional equations and the 
analytic tools in the next section. We then develop more analytic tools we need in Section 3, the 
most difficult part being the proof of admissibility under Hadamard product. Then we focus 
on the characterization of the asymptotic variance of general trie statistics in the following 
sections. We also include PATRICIA tries in Section 7 and conclude this paper with a few 
remarks. 
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2 Random tries, functional equations and asymptotic analy- 
sis 

The design of an ordinary dictionary according to the alphabetical (or lexicographical) order 
induces itself a tree structure, which is also the splitting procedure used in many digital tree 
structures and bucketing algorithms such as tries and radix sort. Tries (coined by Fredkin [40], 
which is a mixture of "tree" and "retrieval") were first introduced in computer algorithms by de 
la Briandais [12] in 1959, the same year when the radix-exchange sort (a digital realization of 
Quicksort) was proposed by Hildebrandt and Isbitz [51]; see [73, §6.3] for more information. 
Tries are one of the most widely adopted prototype data structures for words and strings, and 
admit a large number of extensions and variants. 

Given a set of n random binary strings (each being a sequence of Bernoulli random variables 
with parameter p), we can recursively define the random trie associated with this set as follows. 
If n = 0, then the trie is empty; if n — 1, then the trie is composed of a single (external) node 
holding the input- string; if n > 1, then the trie contains three parts: a root (internal) node used 
to direct keys to the left (when the first bit of the string is 0) or to the right (when the first 
bit of the string is 1), a left sub-trie of the root for keys whose first bits are and a right sub- 
trie for keys whose first bits are 1; strings directing to each of the two subtrees are constructed 
recursively as tries (but using subsequent bits successively). Thus tries are ordered, prefix trees. 
See Figure 3 for a trie of 7 keys. 

Asymptotic analysis of the trie recurrence (3) is nowadays not difficult and a typical way of 
deriving asymptotic estimates starts with the Poisson generating function (4), which satisfies 
the functional equation (5) (when a n does not grow faster than, say exponential), where g(z) 
there depends on b n and the initial conditions. From this, one sees that the Mellin transform of 
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/(*) 



/ 

Jo 



f(z)z s ~ l dz 



satisfies formally 




Then the asymptotics of a n can be manipulated by a two-stage analytic approach: first 
derive asymptotics of f(z) for large \z\ by the inverse Mellin integral 




where the integration path is some vertical line, and then apply the saddle-point method to 
Cauchy's integral formula (called analytic de-Poissonization [61]) 



This two-stage Mellin- saddle approach has been largely developed by Jacquet and Szpankowski 
(see [61]) and can in many real applications be encapsulated into one, called the Poisson- 
Mellin-Newton cycle in Flajolet's papers (see [33, 36]) 



which is formally obtained by substituting (8) into (9) and by interchanging the order of inte- 
gration. 

Note that such a formal representation may be meaningless due to the divergence of the 
integral. One of the most useful tools in justifying the exponential smallness of s] at 
c ± oo is Proposition 5 of Flajolet et al.'s survey paper [29] on Mellin transforms. For ease of 
reference, we call it the Exponential Smallness Lemma in this paper. 

Exponential Smallness Lemma. [29, Prop. 5] If inside the sector | arg(z)| ^ 6 
(9 > 0), f(z) = 0(\z\~ a ), as z ->■ 0, and f(z) = 0(\z\~P) as \z\ -> oo, then 
JZ[f- s ) = 0(e- e l 3 WI) holds uniformly for M(s) E (a, f3). 

This simple Lemma is crucial in the development of our approach. 

In various practical cases, the use of the Poisson-Mellin-Newton approach relies mostly on 
the so-called Rice's integral formula (or integral representation for finite differences) when the 
integral converges; see Figure 4 for a diagrammatic illustration. 

Asymptotics of either of the two integrals (8) and (10) rely heavily on the singularities of the 
integrand, which in turn depends on the location of the zeros of the equation 1 — p~ s — q~ s = 0. 
A detailed study of the zeros can be found in [21], and later in [17, 96]. While the dominant 
asymptotic terms are often easy to characterize when analytic properties of ^[g; s) are known 
(owing largely to the systematic tools Flajolet and his coauthors developed), error analysis 
turned out to be highly challenging when logp/ log q is irrational; see [34]. 

These analytic tools are well-suited for computing the asymptotics of the mean, but soon 
become very messy when adopted for higher moments, which satisfy the same type of re- 
currences but with convolution terms that are often difficult to manipulate analytically. The 




(r > 0). 



(9) 




(10) 
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Figure 4: The two analytic approaches to the asymptotics of a n . Here n n> k '■= {^)p k q a 



situation becomes even worse when dealing with the variance or higher central moments be- 
cause the high concentration of binomial distribution results in smaller variance, meaning more 
complicated cancelations in the desired asymptotic approximations have to be properly taken 
into account. 

The key, crucial step of our approach to the asymptotic variance of trie statistics is to intro- 
duce, as in [54], the corrected Poissonized variance of the form 

V{z):=Uz)-h{zf-zf[{z)\ (11) 

where fx and f 2 denote the Poisson generating functions of the first and the second moments, 
respectively. Such a consideration results in simpler Fourier series expansions for the periodic 
functions that appear in the asymptotic approximations due to the no-cancelation character, 
especially in the symmetric case. We will enhance this approach by introducing the class 
of JS-admissible functions as in our previous paper [54], a notion formulated from Jacquet 
and Szpankowski's works on analytic de-Poissonization (see [61]) and mostly inspired from 
Hayman's classical work [50] on saddle-point method (see also [37, §VIII.5]), via which many 
asymptotic approximations can be derived by checking only simple criteria of admissibility. 
The combined use leads to a very effective, systematic approach that can be easily adapted for 
diverse contexts where a similar type of analytic problems is encountered; see Sections 5 and 6 
for some examples. 

In general, polynomial growth rate for g(z) for large \z\ implies the same for / in a small 
sector containing the real axis. The only exception is the functional-differential equation [2] 

f(z) = f(qz)+g(z), 

for which the growth is of order z clogz when g grows polynomially for large \z\. Note that this 
equation is a special case of the so-called "pantograph equations"; see [2] for more information. 

Notations. Throughout this paper, q = 1 — p and < p < 1. Also h := —plogp — qlogq 
denotes the entropy of the Bernoulli distribution. The splitting distribution /„ is a binomial 
distribution with mean pn. For brevity, we introduce the generic symbol &[G)(x) to denote a 
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bounded periodic function of period 1 of the form 

&[G](x)={ * G z\{o} }, (12) 

where Xk — when |^|| = | with (r, £) = 1. The average value of is zero and the 
Fourier series is always absolutely convergent (it is indeed infinitely differentiable for all cases 
we study). 



3 JS-admissibility, Hadamard product and asymptotic trans- 
fer 

We collect and develop in this section some technical preliminaries, which are needed later for 
our asymptotic analysis. 

3.1 JS-admissible functions 

We begin with recalling the definition and a few fundamental properties from [54] of JS- 
admissibility (a framework combining ideas from [50, 61]). 

Definition 1. Let f(z) be an entire function. Then we say that f(z) is JS-admissible and write 
f G (or more precisely, f G g , ot, j3 G K) if for some < 6 < n/2 and \z\ ^ 1 the 
following two conditions hold. 

(I) (Polynomial growth inside a sector) Uniformly for | arg(z)| ^ 9, 

f(z) = 0{\z\ a (\og + \z\f), 

where log + x := log(l + x). 
(O) (Exponential bound) Uniformly for 6 ^ |arg(z)| ^ tt, 

/(*) := e z f(z) = O (e^l'l) , 

for some e > 0. 

The major reason of introducing JS-admissible functions is to provide a systematic analytic 
justification of the Poisson heuristic a n ~ f(n), where / is the Poisson generating function 
of a n . We do not however pursue optimum conditions here for simplify and easy applications. 
On the other hand, since the conditions of admissibility we impose are strong, we can indeed 
provide a very precise asymptotic characterization of a n . 

Proposition 3.1 ([54]). If f G r> tnen a n satisfies the asymptotic expansion 

a n = f —^-r j {n) + 0(n a - k \og^n), (13) 
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for k = 0, 1, ... , where the Tj 's are polynomials ofn of degree / 2 J given by 



converges as long as / is an entire function; see [54]. It is the asymptotic nature (13) that 
requires more regularity conditions, which can intuitively be seen by observing that x 
n~ j f(n) when / G J^; see [54]. 

The r/s are closely connected to Charlier and Laguerre polynomials; see [54] for a more 
detailed discussion. The first few terms are given as follows. 



T (n) 


n(n) 


T 2 (n) 




r 4 (n) 


75 (n) 




1 





—n 


2n 


3n(n - 2) 


— An(5n — 6) 


-5n(3n 2 - 26n + 24) 



The fact that 7~i = indicates that much information is condensed in the dominant term f(n). 

At the generating function level, the usefulness of JS-admissible functions lies in the closure 
properties under several elementary operations. 

Proposition 3.2 ([54]). Let m be a non-negative integer and a G (0, 1). 



(i) z m ,e- QZ e fS?. 

(ii) If f E jy,then Pf G J?Sf for any polynomial P(z). 
(Hi) Iff G J?y, then f(az) G 

(iv) Iff, 9 e then f + g G 

(v) Iff, g G JST, then f(az)g((l - a)z) G 

(vi) IffE jy, then f^ G 



We will enhance these closure properties by proving that JS-admissibility is also closed 
under Hadamard product. 

3.2 Asymptotic transfer 

For our purposes, we need also a transfer theorem for entire functions satisfying the functional 
equation (5). 

Proposition 3.3. Let f(z) and g(z) be entire functions satisfying 





Note that the Poisson-Charlier expansion 




f(z) = f(pz) + f(qz)+g(z) 



with /(0) given. Then 



f e if and only if g G J^. 
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Proof. The proof is similar to and simpler than that of Proposition 2.4 in [54]. Thus we only 
give the proof for (I). Define 

B(r) := max \f(z)\. 

I arg(z)|^e 

Then 

B(r) ^ B{pr) + B{qr) + O (r Q (log + rf + l) . 
Now define a majorant function K{r) by 

K{r) = K(pr) + K(qr) + C (r a (\og + rf + l) , 

where C > 0. Then B(r) ^ K{r) for a sufficiently large C > 0, and by standard Mellin 
argument [29] or by the proof used in [95] 

( 0(r), if a < 1; 

K(r) = < olr a (log + r f +1 ), if a = 1; 

{ 0(r a (log + r) /3 ), if a > 1. 

This completes the proof. I 

We now refine the asymptotic transfer and focus on asymptotically linear functions. 

Proposition 3.4. Let f and g be entire functions related to each other by the functional equation 
f(z) = f(pz) + f(qz) + g{z) with /(0) given. Assume < 9 < tt/2, a < 1 and (3 G R. 

(a) Ifg(z) = 0{\z\ a (\og + \z\Y), where the O -term holds uniformly for \z\ ^ 1 and | axg(z)\ ^ 
0, a* | z\ — > oo in the same sector, 

^ = + ^[G](r log 1/p z) + o(l), 

where the notations JF[G] (x) and r are defined in (12). 

(b) If g{z) = cz + 0(\z\ a ()og + \z\Y) uniformly for \z\ ^ 1 and |arg(z)| ^ 9, then, as 
\z\ — > oo in the same sector, 

M=c lQgz + ho + j?[ G ]( r iog 1/p z) + o(l), 
z n Iy 



where 



c c(plog 2 p + glog 2 g) 

°'~ ~h Zh? ' (14) 



G{s) is the meromorphic continuation of^[g; s], and 



c := lim G{s) H — 

s->-l \ S + 1 
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Proof. Without loss of generality, we may assume that /(0) = /'(0) = g(0) = g'(0) = 0. Then 
both Mellin transforms exist in the strip (—2, —1} and 

j*\h s] G{s) 



1 — p s — q 



Note that G(s) can be extended to a meromorphic function in the strip (—2, —a — e). In the 
case of (a), G(s) is analytic on the line §ft(s) = — 1 while in the case of (b) G(s) has a unique 

simple pole on 3?(s) = — 1 at s = —1 with the local expansion G(s) = — c/ (s + 1) + Co H . 

Note that by applying the Exponential Smallness Lemma ([29, Prop. 5]), we have the estimate 

\G(a + it)\ = 0(e-W), 

uniformly for large \t\ and a E (—2, —a — e). Thus the Proposition follows from standard 
Mellin analysis (see [29]) and known properties of the zeros of 1 — p~ s — q~ s (see [21]). I 

In the symmetric case when p = q = 1/2, both error terms o(l) in the Proposition can be 
improved to 0(max{l, |2| Q_1 (log |^|)^)}. Indeed, all error terms in such a case in this paper 
can be improved by standard argument; we focus instead on the Fourier series expansion in this 
paper. 



3.3 A Hadamard product for Poisson generating functions 

We need a new closure property for the analysis of the variance. Given two Poisson generating 
functions 

z — 4 n\ A — 4 n\ 
we define the Hadamard product of these two functions as 

~h(z) :=f(z)Q~g(z) = e- z J2^¥ zn - 

This definition differs from but coincides with the usual one if we consider the corresponding 
exponential generating functions. We show that JS-admissibility is closed under the Hadamard 
product. The proof is subtle and delicate. 

Proposition 3.5. /// e JS^^ and g G JS^^, then h e J5? a 

1+ce2t p 1+ /3 2 - More precisely, 

we have 

h(z) = f(z)g(z) + zf{z)g'{z) + O (|^| Ql+a2 - 2 (log + \z\f^) , (15) 
uniformly as \z\ —> oo and \ arg(z)| ^ 9, where < 6 < it/ 2. 

Our proof indeed gives an asymptotic expansion for h; we content ourselves with the state- 
ment of (15), which is sufficient for our purposes. 

Proof. Let f(z) := e z f(z),g(z) := e z g(z) and h(z) := e z h(z). Also let < 9 < tt/2 be 
an angle where (I) holds for both f(z) and g(z). Note that conditions (I) and (O) remain true 
if #o is replaced by an arbitrarily small but fixed angle < 9 ^ 9 with a suitable choice of 

e = e(9). 
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We prove the proposition in the special case when /3\ = (3 2 = since the proof in the 
general case remains the same with only additional logarithmic terms in the corresponding 
error estimates. 

Define 

2 

Substituting here z — > zu, multiplying both sides by ue~ u , integrating from to infinity and 
multiplying the result by e~ z2 /2, we obtain 

We now fix a < 9 < 9 . We first show that h(z) satisfies condition (O) for z lying outside the 
sector | arg(,z)| ^ 6. Assume 9/2 ^ \y\ ^ ir/2. Then 

J(re iy ) = — [ /(re^ +?/) Wre iM ) dt 

Note that (I) and (O) imply that 

f(z)=0{(\zr + l)e^), and g(z) = O ((\z\ a > + l)e W ) , (16) 

uniformly for z G C. Now making the change of variables t h-> t — y and taking into account 
that the function under the integral sign is periodic, we see that 

j(re iy ) = —[ /(re%(re i(22/ ^)dt 



2ti 



— 7T 

i r y 



2tt 



/(re^^re^ 2 ^) dt + O ((r a2 + l)e (2 " e)r ) 



y 

it\ 



Here we evaluated the integral over the region \y\ ^ \t\ ^ 7r by the estimate \f(re )| = 
0(e^ 1_E ^ r ), which follows from (O), and used the upper bound of (16) for \g(re l ^ 2y ^)\. In a 
similar way, we note that < 9/2 ^ \2y — t\ ^ 37r/2 whenever |t| ^ |y| and # ^ |y| ^ n/2. 
This means that 2 = re^ 2y ~ t ' ) lies inside the sector | arg(z)| ^9/2 and as a consequence we 
can use estimates \g(re l( - 2y ~ tS) )\ = 0(e^ 1 ~ e ^ r ) and \f(re lt )\ = 0((r ai + l)e r ) to evaluate the 
integral over the range \t\ ^ \y\. Combining these estimates, we get 

J(re iy ) = O ((r ai + r° 2 + l)e (2 " £)r ) 
= O (e 2 ^) , 

where e x is chosen such that (2 — e)/2 < si < 1. This estimate yields 



ue 2e ir u e -u du \ =0 re ^ 



e r2e2w h(r 2 e 2ty ) = O Qf 

which implies, by replacing r — > y/r and y — > y/2, the estimate 

e reW h{re iy ) = O (re^ 1 
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in the region 9 ^ \y\ ^ n for any fixed 9 > 0. Thus condition (O) holds. 

We now prove that h(z) grows polynomially in the sector | arg z\ ^ 9 with some sufficiently 
small 9 > 0. 

Note that | arg(ze ±lt )\ ^ 9$ /A for all values of t and z such that |argz| ^ 0o/A and 
n^\t\^ 0„/2, which, by (I) and (O), implies that /(«e ft ) = 0(e £2 ^) and #(W*) = 0(e £2 l 2 l) 
with a suitable choice of e 2 < 1 for all z and i satisfying such restrictions. It follows that 



1 r e o/ 2 

J{z) = — / e^^fize^gize-*) dt + O (e 26 ^) 
27r J-e /2 



when | argz| ^ 9 /A. Thus 

„-2 2 roo rdo/2 _ 

/^ 2 ) = - — / ue ~ u2 / e^^fizuj^gizue-*) dtdu + 0( \ z \ e 4\A 2 -^ 2 ) ) . ( 17 ) 

Noting that $l(z 2 ) = cos(2 aig(z))\z\ 2 ^ (1 - arg(z) 2 /2)|,2| 2 , we then have 

/^ 2 ) = + O (l^le-C 1 -^-" s(-) 2 /2))kl 2 j ; (lg) 
where J(z) denotes the double integral 



An 



oo 



'0 i|t|sS6» /2 



Since | arg(z)| ^ 9 /A, the arguments zue lt and zue~' lt of the functions / and g lie inside the 
sector | arg(^)| ^ 9 , which means that f(zue lt ) = 0(\z\ ai ) and g(zue~ %t ) = 0(\z\ a2 ). 

Changing the order of integration and making a change of integration path from the interval 
u E (0, oo) to the line (0, zoo) by mapping u4 uz cost, we get 



J2 rdo/2 pco 

I(z) = — / / e' z2{u - l)2cos2t - z2sln2t f(z 2 ue lt cost)~g(z 2 ue- it cost)uducos 2 tdt 
4tt J-0 o / 2 Jq 



f>0o/2 poo 

O ( \z\ 2 I / e- R ^)(«- 1 ) 3 ^ a ^/ 2 )- R ^)^ at (|z 2 |u + l) ai (|z 2 |u + l) 0,a ud«dt 

6»o /2 JO 
/■oo 

0(k| 2 ( / e-^ u -^ cos2 ^ 2 \\z\ 2 u + ir +a2 udu) I e-^ sin2 'dt 
Vvo / J\t\^e /2 

(\ z \ _|_ ]_)2+2ai+2a2 



= 0(|z| 2ai+2a2 ), 

uniformly for large \z\ in the sector | arg(z)| ^ 9o/A. 

Applying this estimate to the expression (17) of h(z 2 ), we obtain 

h(z 2 ) = O (\z\ 2ai+2a2 ) . 

Thus 

h{z) = O (\z\ ai+a2 ) . 
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for all | argz\ ^ 9 ^ 9 /2, where 9 is chosen to be small enough to ensure that the error term 
in (18) decreases exponentially fast. This proves the first part of the proposition. 
To prove the finer estimate (15), we use the Taylor expansion 

f{z)= ^^(^-^>0(m a x{(|z| + l) ai - w ,(| W | + l) Ql - w }|^-^| w )^ 

0<j<N 3- 

for any fixed N ^ 1. Applying this formula with z — > z 2 ue %t cost and w — > z 2 , we get 

f(z 2 ue u cost) = fb) -P z2j ( ueit cost ~ l ) j 

+ O {{\z\ 2 (u + 1) + l) Q >e i4 cost - 11") , 
and a similar expression for g. Substituting these expressions with N = 4 into I(z), we get 

~2 [6 /2 poo 

I(z) = — / / e-^^^'^^-^^V^Vcos^^^Me-^cos^udwcos^dt 
4?r J-e () /2Jo 

e /2 foo 

-z 2 {u-\) 2 cos 2 t-z 2 sin 2 tt qi At _ . _ iU/„.-il „ nQ ^ _ i V,, J,. „ nQ 2 



ue cost - 1) (we cost - 1) wdwcos tdt 
e /2Jo 

/ \ 

O 



0<fc,Z<4 

fc+;>3 



/ 



(19) 



where 



6» /2 /-oo 

2 2 («-l) 2 cos 2 t-2 2 sin 2 t i, ,/i |2/ , , ,x , -i\p\ 0l Jt t 



I PtK := / / e- z ^- 1)cost - zsint udu(\z\ 2 (u + l) + l) p \ue u cost-l\ K dt. 

Applying now the inequality 

\ue lt cost — 1| = |ucost — e~ lt \ = |ucost — cost + zsint| ^ \u— 1| + |t|, 



we get 



' r e /2 r oo \ 

/ / e -X(z 2 )(u-l) 2 cos 2 t-X(z 2 ) sin 2 * u (| z |2( u + 1) + l)P(|„ _ l|" + | t |«) du dt . 

. J-80/2J0 J 



Note that 3ft(z 2 ) ^ |z| 2 cos26 l and sin 2 t ^ cit 2 when | arg(z)| < 9 and |t| < 9 /2 for some 
constant c\ > 0. Thus there exists a positive constant c > such that 

3tf(^ 2 )(u- l) 2 cos 2 t + K(^ 2 )sin 2 t ^ c(u - l) 2 \z\ 2 + ct 2 , 
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for | arg(z)| < and \t\ < 6 /2. It then follows that 

/ p6 /2 poo \ 

IpK = [ / e-^-^-^udzWu + 1) + iy(\u - 1\ K + \t\ K ) dudt 

w-eo/2 y 

= 0\ ( r e- c{u - 1)2 ^ (\z\ 2 (u + l) + l) p \u - l\ K udu) f°' 2 e- c|z|2 * 2 dt 

/ poo \ Mj/2 N 

+ (/ e- c(n - 1)2|z|2 (|z| 2 (u + l) + l)^du / e- c|2|2 ' 2 |t| K dt 
Vvo / J-e /2 

= o ((\z\ 2 + iy\z\- K - 2 ) . 

Substituting this bound in the error term of (19), we obtain 

I{z)= £ /W( ^f ( ^ ) ^ + Q(kl 2ai+2Q2 - 4 ) i 

where 

z 2(l+fc+0 

^ = ~^r~ 

/8 /2 poo 
/ e -, 2 («-l) 2 cos* t-z* sinH^t cog t _ ^fc^-it cos t _ du CQS 2 t dt 

6*0/2^0 

We can approximate the integral Sk,i by reversing the order of the procedure by which we 
obtained it. First, making the change of variables u i-> u/(z cost), we get 

i p0 o /2 poo 

S kl = — / e- {u - zcost)2 - z2sin2t (zue u - z^izue-* - z 2 ) l ududt 

47r J-80/2J0 



p—z 2 p n p°° 

— J J e- 2zucost (zu^ - z 2 ) k {zue~ a - z 2 ) l ue' u2 du dt 

+ O ({1 + \ z \k+ l ) e M 2 (z)™s 2 (6o/2)-3t(z*)\ 

POO / 1*77 \ 

J (J [e ZU£lt (zue lt - z 2 ) k ] [e zue ~ U (zue- u - z 2 ) 1 ] dt) ue~ u2 du 
O [{I + \ z \k+i} e MH*)™s 2 (e /2)-3i(z 2 )^ 

. z 2 ^ U n,k^n,l z 2n + Q ^ + |^|*+i) e » 2 W cos 2 (0 o /2)-5R( Z 2 ) ^ ^ 



47T 



n>0 

where the z/„ fc's are defined by 



In particular, 

(n — z 2 ) 2 



S hl = e- zi [z' + Y. J z 2n )+0 ((1 + \z\ 2 )e^ cos2 ^-^) 
= z 2 + o(jl + | 2 | 2 )e R2(2)cos2(e ° /2) - R(z2) ) . 
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Similarly, 

5 ,o = l + o(e R2 W cos2 ^/ 2 )-^ 2 )), 

and 

whenever m ^ n. Therefore 

h(z 2 ) = I(z) + O ((1 + \ z \^+^ e ^{ z ) cos ^e /2y^)\ 
= f(z 2 )g{z 2 ) + z 2 f'(z 2 )g'(z 2 ) + O (\ z \^+^-^ _ 

Accordingly, 

h( z ) = I{Jz) + O ((1 + |^i+" 2 ) e » 2 Wc°sW2)-^ 2 )j 
= f(z)g(z) + z]\z)~g\z) + O (|^|«^^ ~ 2 ) . 
This proves the second part and completes the proof of the proposition. I 

4 Asymptotic variance of trie statistics 

We address in this section the asymptotic variance of general trie statistics. 

Let X n be an additive shape parameter in a random trie of size n. Then X n satisfies the 
following distributional recurrence 

X n ±X In + X*_ In + T n (n>2), (20) 

where I n = Binom(n,p), X n — X*, and X n ,X*, I n ,T n are independent. Without loss of 
generality, we may assume that X = and X\ — (only minor modifications needed when 
under nonzero initial conditions). Changing the value of X and X\ affects only the mean but 
not the variance. 

Consider first the moment-generating functions M n {y) := E,(e x " y ). Then, by (20), 

M n {y) = E(e T " y ) £ 7c n , k M k {y)M n _ k (y) (n ^ 2), 

Q</e<n 

where 7r n fc := (£)p k q n ~ k . By taking derivatives, we obtain the recurrences for the first two 
moments (fi n := E(X n ) and s n := E(X 2 )) 

= TTn.fc (jUfc + V>n-k) + E(T n ) 

s„ = ^ vr nifc (s h + s n _ k ) + E(T 2 ) pi) 
+ 2 7r„ ifc (nk^n-k + E(T n ) (/i fe + /i n -fc)) • 
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Our major interest lies in the variance o\ := V(X n ), which also satisfies the same type of 
recurrence 

where A n>fc := fj, k + /j, n _ k - fi n + E(T n ). 

Different approaches have been proposed to the asymptotics of o\\ these include an ele- 
mentary induction approach (see [7, 52]), the second-moment approach (see [67, 69, 70, 71]), 
Poissonized variance (by considering fi{z) — fi(z)) approach (see [59, 60, 76, 93, 77, 90]), 
(bivariate) characteristic function approach ([57, 77, 60]), and Schachinger's differencing ap- 
proach [95]. 

The slight modification of our approach, which relies on (11), from the usual Poissonized 
variance one turns out to be very helpful and makes a significant difference, notably in the re- 
sulting expressions for the periodic functions, mostly because the cancelation is avoided (some- 
how incorporated in the generating functions). 



4.1 Analytic schemes for the mean and the variance 

The tools we developed in Section 3 are useful in establishing simple, general, analytic frame- 
works under which asymptotics of the mean and the variance can be easily derived by checking 
only a few sufficient conditions. 

Asymptotics of the mean. Denote by fi(z) and gi(z) the Poisson generating function of 
Epf n ) and E(T n ), respectively. Then 

fi(z) = f 1 (pz) + f 1 {qz) + g 1 {z) 1 

with/ 1 (0) = /i(0) = 0. 

Theorem 4.1. Let < 6 < tt/2, a < 1 and (3 6 R. If either g"\ e J,5^ a or gi(z) G J,5^ , and 
gi(z) = cz + O (|z| a (log + Iz]) 13 ) uniformly as \z\ — > oo and \ arg(z)| ^ 9, where c e R, then 

E(Xn) - £ logn + d + &\G x \(r log 1/p n) + o(l), 



n 



where d = G\{—l)/h if c = and d = ho (see (14)) if c ^ 0, G\(s) := <M\g\\ s] and the other 
notations are described as in Proposition 3.4. 



Proof. Combining Propositions 3.1 and 3.3, we have 

FCi) 
3 



E(x n )= ^r^W + oM 

0s£j'<2fc 

for k = 0, 1, Then apply Proposition 3.4. I 

If the initial conditions are not zero, say X = a and Xi = b, then we consider fi(z) 
fi(z) — (b — a)z — a, leading to the functional equation 

h{z) = h(pz) + Uqz) + ~ gi (z) + a(l - (1 + z)e~ z ), 
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which results in an additional linear term for E(X n ) of the form 

(b - a + °- + a^[G )(c log 1/p n)J n, 

where Gq(s) := — (s + l)T(s). Such an additional term is related to the expected size of tries; 
see Section 5.1. 

Functional equations related to the variance. For the variance, we begin with the second 
moment. Let f 2 {z) and g 2 (z) be the Poisson generating function of E(X^) and E(T^), respec- 
tively. Then, by (21), 

f 2 (z) = f 2 (pz) + f 2 {qz) + 2f x (j>z)h(qz) + + 

where 

h 2 (z) = 2e~ z ^E(T n ) ^ n n:k (jj, k + fjL n - k ) — (22) 

= 2e~* J2 E ( T n)»n^ - 2e- z ^(E(T n )) 2 — , 
z — ' nl z — ' n! 

the last two terms being Hadamard products. 
Now, let 

V x {z) := h{z) - ~h{zf - zfi(z) 2 
V T (z) :=g 2 (z)-~g 1 (z) 2 -zg' 1 (z) 2 . 

Then by a straightforward computation 

V x (z) = V x (pz) + V x (qz) + V T (z) + fa(z) + fa(z), (23) 

where 

Mz) := h 2 (z) - 2~ 9l {z) (h(pz) + Uqz)) - 2z~g[(z) (pf[(pz) + qf[(qz)) 

, ; , 2 (24) 

02 0) := pgz (f[(pz) - f[{qz) 

Ideas of our approach. We sketch here the underlying ideas used in our approach before 
presenting a simple analytic scheme for the asymptotics of the variance. We assume first that 
gi E JPS^ • This implies the JS-admissibility of /i, and thus, by Proposition 3.1, we have the 
asymptotic expansion for the mean 



/'»= E ^Y^^(")+o(/i(n)n- fc ) (* = 1,2,... 



If we also assume g 2 € ^A^* tnen we nave tne same type of expansion for E(X 2 ) with fi there 
replaced by f 2 . Thus (dropping error terms for convenience of presentation) 

Q<j<2fc J ' \0^j<2k J ' 
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Now substituting / 2 = Vx + f\ + z{f[) 2 yields formally 

^~^(n)-^(n)-y^(n) 2 + ---, 

under suitable growth conditions and a suitably chosen k. Thus the asymptotics of the variance 
is reduced to that of V(n) and its derivatives. Further extensions of this approach are discussed 
in detail elsewhere. 



Asymptotics of the variance. We now show that the variance of X n can also be handled 
in a general way by reducing the required asymptotics to essentially checking conditions for 
JS-admissibility. 

Theorem 4.2. Let < 9 < n/2,a < 1 and (3 E R. Assume g 2 E J?Sf and V T {z) = 
O (|z| Q (log + \z\)P) as \z\ — > oo in the sector | axg(z)| ^ 9. 

(a) Ifp = q = 1/2, andgi E or~g x E J^q, then 

n log 2 ' 

w/zere Xfc = §3 ««d = JZ[V T + s]. 

(b) Assume p ^ q. 

(i) if h e 

^ = ^ + ^[G](rlog 1/p n) + o(l). 

Here G(s) = $i(s) + $ 2 (s), w/zere $i(s) = ^£[V T + 4>i;s] and $ 2 (s) w r/i<? 
analytic continuation of ^[4>2] s}. 

(ii) Ifgi E J?,5f andg\ = z + 0(\z\ a (log + |z|)^) uniformly as \z\ — >■ oo and | arg(z)| ^ 

V(X„) pq\og 2 (p/q) d pq log 2 (p/q)(p log 2 p + qlog 2 q) 
— = P log-+^ + W4 

+ ^[G](rlog 1/p n) + o(l), 

Here G(s) = $i(s) + $ 2 (s), w^ere $i(s) = ^#[Vr + and $ 2 (s) w f/ie 

meromorphic continuation o/^#[0 2 ;s], anJ d = $i(— 1) + lim s ^_ 1 ($ 2 ( s ) + 
pglog 2 (p/g)/(/i 2 (s + l))). 

Proof. Since is assumed to be small (less than linear), we first show that, under the assump- 
tions of the theorem, both <pi and 2 are also small; see (23). 

If g\ E c/^a, then f x E ^^5% by Theorem 4.1. These imply, by Proposition 3.5, that 
h 2 E and 

h 2 (z) = 2~ gi {z)h{z) + 2zg[(z)f 1 (z) - 2^) 2 - 2z~g[(z) 2 + O (M*" 1 ^ \z\f) 
= 2~ gi {z) (Upz) + Uqz)) + 2zg[{z) (pf[(pz) + qf[(qz) 



+ o(\z\ a - 1 (log + |z|) /3 ) 
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uniformly as \z\ — > oo and | arg(z)| ^ 9. It follows, by (24), that 0i (z) = O (\z\ a 1 (log + \z\)^ 

Similarly, if g\ G J'S^ , then h 2 G JtS? and 4>x{z) = O (log + \z\) as |^| — > oo in the sector 
| arg(z)| ^ 9. 

Without loss of generality, we may assume that all generating functions / involved here 
have the property that /(0) = /'(0) = 0. 

Consider first V T + <p 1 . By assumption and by the preceding analysis, we see that ^[Vp + 
(pi ; s] exists in the strip (—2, —a — e), with e > arbitrarily small. Thus we argue as in 
Proposition 3.4 and obtain (25). Note that 9 > is crucial here. 

We now turn to <p 2 , which is zero when p = q. So assume now p ^ q. If g\ E ^^ a/3 , 
then its Mellin transform G\ exists in the strip (—2, —a — e) and, by applying the Exponential 
Smallness Lemma, is exponentially small at c ± ioo. Thus from the integral representation 



2m y(_x_ e ) 1 - p~ w - q~ v 



it follows that f[(pz) — f[(qz) = o(l) and consequently by (24) <p 2 (z) = o(\z\) as \z\ — >• oo 
and | arg(z)| ^ Thus the Mellin transform $2(5) of ^2(2) exists in the strip (—3, —1) and 



Note that 



2?rz y ( _ 1/2) (1 - p 1 -" 1 - - p w - s - g"- s ) (26) 

x (io — l)Gi(w; — — — w) dw. 

p -w_ g -w _(l_p-w) + (l_ g -«) 



If log p/ log q = r/£ e Q, where (r, £) = 1 are positive integers, then any zero of the form 
2rk%i/ logp of the denominator is also a zero of the numerator. Thus the integration path can 
be moved to the imaginary axis. 

By summing over all residues of poles with real parts less than —a (see [21] for a detailed 
study), we see that $2(5) can be extended to a meromorphic function beyond the line 3ft(s) = 
— 1 which is analytic on 3ft(s) = —1. Consequently, the asymptotic estimate in case (b)-(i) 
follows as in Proposition 3.4. 

The analysis for the last part (ii) is similar with the only difference that now 2 (z) = 
pq log 2 (p/q)z/h 2 + o(\z\) uniformly as \z\ — > 00 and |arg(z)| ^ 9. Hence, one can again 
extend M\<\>2\ s] to a meromorphic function beyond the line 3ft (s) = —1, but there is a simple 
pole on 3ft (s) = — 1 at s = — 1 with the singular expansion ^#[02 5 s] = —pq log 2 (p/q)/(h 2 (s + 

1)) + d-\ . Thus similar arguments used in Proposition 3.4 apply. This completes the proof. 

I 



Calculation of the Fourier coefficients <& 2 (— 1+Xfc)' We outline here an approach by residue 
calculus to simplify the Fourier coefficients $2(— 1 + Xk), which will be applied several times 
later. 

We begin with the integral representation (26), which we first shift to the imaginary axis. 
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Then, we use the following decomposition 

* , , ^ 1 f f 1 p 1+w - q 1+w 



x (w - l)Gi(tw - 1)(-1 + Xk - w)Gi(-l + Xk~w) dw 

=: Jo + J, 

where the integration contour f, Q . + is the imaginary axis but with a sufficiently small indenta- 
tion to the right of each zero of the equation 1 — p l ~ lt — q x ~ %t = for real t (only one when 
j^H 7^ Q, and an infinity number of equally- spaced ones otherwise). 

By the change of variables w h->- Xk — w and then by moving the line of integration to the 
right, we have 

~ ~ !)(-! + Xk ~ w)G 1 (-l + Xk ~ w) ^ 

° 27U y (0) _ 1 -pl-Ho _ 



1 /" (w - l)Gi(^ ~ !)(-! + Xfr - w)Gi(-l + Xfc ~ w) ^ 
2tu 7 (0)+ 1 - - ™' 

where the integration contour = — /( )+- The last integral equals 

— / (w- l)Gi(w - 1)(-1 + Xfc - w)Gi(-l + Xfc - w) dw + J. 
2m J {0) + 

Note that 

roc 

(w - \)G x {w - 1)(-1 + Xfc - «;)G?i(-l + Xfc - w) dw = / &(*) 2 r 1+ ** dt. 



1 

2vri ^ (0)H 

Combining these relations, we obtain 



$ 2 (-l + Xfc) = 2J - \ J2(Xi ~ l ) G ^i ~ !)(-! + Xfc-i)Gi(-l + X fc 



poo 

+ m 2 t- l+xk dt. 

Jo 

To further simplify the integral J, we write 



h 

'-~ (27) 



j>2 • 7 ' 



Then it follows from the Direct Mapping Theorem (see [29]) of Mellin transform that Gi(s) 
can be extended to a meromorphic function to the left of 9R(s) = —2 with simple poles at 
s = — j, the residue there being equal to bj/j\. 
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If we assume that — (s — l)C?i(s — 1) has no singularity to the right of imaginary axis, then 
we obtain 



This and (27) will be useful later. 

This procedure is very effective in many applications having linear variance (namely, the 
situation of Theorem 4.2 (b)-(i)); similar but slightly more involved arguments can be used in 
more general situations such as n log n- variance. 



We apply or slightly modify the schemes developed in the previous sections to a few stan- 
dard examples in the literature for which new results are proposed for the asymptotics of the 
variance. 

5.1 Size of random tries 

The size of a trie is defined to be the number of internal nodes used, which becomes a random 
variable when the input sequence is random. For example, eight internal nodes are used in 
the trie in Figure 3. Under our Bernoulli model, we see that the size X n satisfies (1) with 
X = X\ = and T n = 1 for n ^ 2, where n denotes the total number of input keys 
(or external nodes). Under different guises and different initial conditions, this is the most 
studied random variable defined on tries or related structures in the literature, most of them 
dealing with the expected value and very few of them with the variance. See, for example, 
[5, 9, 15, 62, 79, 85, 99] and the references therein for the mean, and [59, 64, 65, 67, 68, 77, 93] 
for the variance. 

Since T n — 1, we have 



V T {z) := ~ gl{z) - zg[(z) 2 = e~ z (l + z-(l + 2z + z 2 + z 3 )e~ z ), (29) 




(28) 



5 Applications 



g^z) = g 2 (z) = 1 - (1 + z)e~ z . 



From Proposition 3.2, we see that both functions gi, g 2 G ^0^ o . Also we have 



and (see (24)) 



fa(z) = 2e- z ((1 + z) (hipz) + Uqz)) - z 2 (pf[(pz) + qft{qz)j) . 



(30) 



Both functions are exponentially small for large \z\ with $t(z) > 0. 

The application of both Theorems 4.1 and 4.2 is straightforward. Since 



Gi(s) = ...#[<?i ;s] 



(s + l)T(s), 



we thus obtain, when X { 



o — 



X 



i — 



0, 



E{X n ) 1 



+ «^[Gi](rlog 1/p n) + o(l) 



n h 



24 



a well-known result. When X = a and X 1 = b, then a direct modification of the same 
argument gives 

= b _ a + ( a + i) Q + j?[ Gl ]( r log 1/p n)^ + o(l). 

As regards the variance, the functions involved become more complicated. We state our 
results by distinguishing between symmetric case p — 1/2 and asymmetric case p ^ q. 

Theorem 5.1 (Symmetric case: p = 1/2). The variance of the size of random symmetric tries 
satisfies asymptotically (Xk '■= Zkni/ log 2 J 



V(X n ) 



1 J2G(-l + X k)n^+o(l), 



n log 2 

where the mean value of the periodic function is given by 

^— *i = — *- | - + 2 V tzlKj ~ | w 0.845858623076001 ■ ■ ■ , (31) 
log2 log2 ^4 23 - i J yj > 



and for k ^ 

G( " 1 + Xfe) = 4 + 2 Z. U + 1)1(2,-1) • ( 32 ) 



i>l 

The numerical value in (31) coincides with that given in [93], where they derived the alter- 
native expression 

_J 1 ( z iy_^ y _^. 

21og2 l g 2 2 log2^2i-l log 3 2 jf y sinh g ' 

see also [68]. This expression can also be derived by the simplification procedure for deriving 
(27) (see also Theorem 5.2 below). Equating the above two expressions yields the identity 

which can be proved directly by the residue calculus similar to that used in deriving (27). Of 
special mention here is that the series on the right-hand side is less than 1.1 x 10~ 10 , meaning 
that the first two terms on the right-hand side already provide a very accurate approximation to 
the series on the left-hand side. A third expression with the same numerical value is given in 
[77, Sec. 5.4] 

3 



1 ( 1 2 V 1 ^ 1 47r2 V 

^2*+T/~ lo72 ~ log^ |^nh |g 
which can be obtained from (33) by the identity 

v i _ v i-iy- 1 

2^ 23 + 1 ^ 1? - 1 ' 

3>l j>l 
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Regarding the oscillating terms, Kirschenhofer and Prodinger derived in [68] (with terms slightly 
simplified and with minor corrections) 



G{-\ + X k) = -3x*r(-i + Xk ) - (i - xk)(2 - Xk)T( X k) \ 



(Xk+j){^) 



Xfcr(i + Xk) 

log 2 



2T(1 + X k) 



-(i + l)(2i-l) 
5-Xk ^(Xk+j + lK-f^ 1 ) 



4(1 -Xk) j£ (j + l)(2i-l) 



+ i^2 E x,r(-i + xi)xmr(i + 



Xr, 



j+m=k 



which is to be compared with our expression (32). Numerically, the amplitude of the oscillating 
part is bounded above by J^^^g |G(—1 + Xfc) I /log 2 ^ 1.7 x 10 -6 ; see Figure 5. 





Figure 5: Periodic oscillations of the variance when p = 1/2: Y(X n )/n in logarithmic scale 



(left) and the fluctuating part Z^k^o + Xfc) e 



(right). 



We now state the result in the asymmetric case. 

Theorem 5.2 (Asymmetric case: p ^ q). The variance of the size of random asymmetric tries 
satisfies 



V(X n ) G(-l) 



n 



h 



+ &[G](rlog l/p n) + o(l) 



where 



G(-l) 



1 1 



i)V + <? J ) 



2 h ^ 1 - p> - qi 



—y 



4rj7r s 



_ ) Mogp^sinh^ 

1 J^l logf 



2 ) 
V 



if 



log (J 



■rlogp h 
V logq " 



and for k^O (only when gj| e Q) 

G(-i + x *) = x*r(-i + xu) (i - - 1 £ r (* + W**-* + X > 



(-iy(j + 1 + X k)T(j + Xfc) (^ +1 + q j+1 ) 
(j-l)!(j + l)(l-^+ 1 -^' +1 ) 



(35) 



(36) 



26 



These expressions also hold in the symmetric case. However, the expressions for the Fourier 
coefficients in Theorem 5.1 are simpler. 

While the asymptotic pattern of the variance has long been known, the expressions for the 
Fourier coefficients have, as far as we were aware, never been stated before in the above explicit 
forms. 

Consider, for concreteness, the special rational case when q = p 2 . Then p = (V5 — l)/2 is 
the golden ratio. From (35), we see that the non-periodic dominant term for the ratio between 
the variance and n is given by 



which is larger than the symmetric case (31). In general, G(—l) = G(—l;p) is a symmetric 
bath-tub-shaped function of p with its lowest value reached at p = 0.5. The fluctuation of the 
periodic part is bounded above in modulus by 7.3 x 10~ 8 ; see Figure 6. 



Figure 6: Periodic oscillations of the variance whenp= (y/5- l)/2: V(X n )/ n in logarithmic 
scale (left) and the fluctuating part J?[G](x) (right). 

Proof of Theorem 5.1 and Theorem 5.2. We look first at $i(s) = ^\Vt + 4>\] s). By (29) and 
(30), we have 




1.00834 52644 70994..., 





where 




((1 + z) (h(pz) + f x {qz)) - z 2 (pf[(pz) + qf[{qz))) dz. (37) 



To simplify this integral, we use the inverse Mellin integral 




which, by taking derivative with respect to z, 




(38) 
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Substituting these into (37) yields 

w , If (w + + w)(s - w) + l)T(w)T(s - w)(p~ w + q~ w ) , 

ri(s) = / du; 

2tt2 1 - g- w 

= ^ i 1 )'./(./(./ + 8 + 1) - i)r(j + s + + q> +1 ) 

j£ (j+ 1)!(1 

where the last expression is obtained by shifting the line of integration to the left and by col- 
lecting all the residues encountered. 
In particular, when p — 1/2, 

*!(*) = (s+ms) (i - g2 + 28 4 + s 3 + 8 

, .,W + g + 1) - i)r(j + a + (39) 

(i + l)!(2i-l) 

which proves (31) and (32). 

We now consider $ 2 (s) = ~#[0 2 ; s] (see (24)). By (26), 

^ pq_ r T(w + l)(p- w -q- w ) T(s-w + 2) (p— 1 - q^ 1 ) ^ 

2 ^ S) 2^7(0) l-pi-w-qi-w i_ p w-s_ q w-s 

where we shifted the line of integration to the imaginary axis. Since the above function is also 
analytic on 3ff (s) = —1, we can substitute s = — 1 + Xk- The expressions (35) and (36) are then 
obtained by the simplification procedure that we used to derive (27) (for $ 2 (— 1 + Xk)) an d (28) 
with gx(z) = 1 - (1 + z)e~ z and Gx(s) = -(s + l)V(s). I 

An alternative way of simplifying $ 2 (— 1 + Xk) is to shift the line of integration of (40) to 
the left and collect all residues encountered. This then yields the somehow more complicated 
expression 

(j — 1)!(1 — p L+: > — q i+1 )(l — p 1 3 — q l J) 

r(w i + i)r(- Wj + xfe + 1) 



-E 



^1+^ J Q g p _|_ gl+W,' l Q g g 

where Uj runs over all zeros of 1 — p 1+w — q 1+w = with %t{u)j) < 0, and we used the relation 

pq(jf>i - (fj)(p- u i - q-^i) 

1 — — g 1 "^ 

Here the convergence of the second series follows from the exponential decay of Gamma func- 
tion at c ± ioo and the property that the zeros uij are isolated in nature (and equally spaced 
along vertical lines when |^|| 6 Q); see [21] for details. Then we apply the same procedure 
for deriving (27) to further simplify the second series. 
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5.2 External path length 

The cost of constructing tries is directly proportional to the external path length, which is the 
sum of all the distances between each external node (where keys are stored) to the root. For 
example, the external path length of the trie showed in Figure 3 equals 2 + 3+4x3+5x2 = 27. 
Under the same Bernoulli model, the external path length is a random variable, still denoted by 
X n , satisfying (1) with T n = n. This implies that the Poisson generating functions of the first 
two moments of T n are given by 

gi(z) = z(l - e~ z ) and g 2 (z) = z(l + z - e~ z ). 

Thus JS-admissibility of these two functions follows directly from Proposition 3.2. Also 

gi(z) = z + O (\z\~ 5 ) uniformly as \z\ — > oo and |arg(z)| < n/2 — e for all e,5 > 0. 
Moreover, Vt(z) = ze~ z (l — e~ z (l — z + z 2 )) and 

Mz) = e~ z (2zf\(pz) + 2zfx(qz) + 2pz{\ - z)f[(pz) + 2qz{\ - z)f[{qz)) , 

both being again exponential small. 

For the expected value, we have Gi(s) := .M\g\ \ s) = —T(s + 1) and thus 

lim (g 1 ( 8 ) + ——) =7, 

s->-l y S +1 J 

where 7 is Euler's constant. Applying Theorem 4.1, we obtain 

E(-X») 1 7 plog 2 p + qlog 2 q 

= - logn + - + — + ^Gfi](rlog 1/p 7i) + o(l). 

While this result has been widely known and discussed (see, for example, [73, 57, 77]), the 
variance is rarely addressed (see [57, 58, 70]) due partly to its complexity and partly to method- 
ological limitations. 

Theorem 5.3. The variance of the total external path length satisfies 

& feez 

in the symmetric case (when p = 1/2), and 

Y(X n ) pqhg 2 (p/q) d pq hg 2 (p/q) (plog 2 p + qlog 2 q) 
— = P logn+^ + w 

+ ^[G](rlog 1/p n) + o(l), 

in the asymmetric case, where G = $1 + $ 2 with $1, $2 and d given below in (41), (43), and 
(42), respectively. 

The proof follows the same pattern as that used for the size, details being omitted here. In 
particular, we have 

<M» IT* • 1)1 I — S ' 



2 S + 3 

v (-iy(j(s + j) - l)(^ +1 + q^)Y(s + j + 1) (41) 
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where 

d = M . l)+ „^iMA L + i + £jggWgi!j + logp+iogg -, + (42) 



and for 7^ 0, 



*a(-l + X*) = pq ° g2 ^ /q) (Xk ~ l)r(x») + + X*), (43) 



where 



1 7T 2 1 plog 3 p + glog 3 g (plog 2 p + glog 2 g) 2 
/l( - 1) = 4- l0g2+ 67^ + m + W 



2£ 



1 ./ii /"- ; '/'•') 



if logp ^ 
log (J >^ 



and for 7^ 0, 

/l(-l+Xfc) = J'< U) ( U - i - Vfc 22 " + 7 fe ' ' ) + (( 1 - U)(<'( \A " ) - U ) 



x 2 -3 Xfc + 4 \ 2r( X 
2 2 +» h 



v (-1)^(3 + + j - l)T( Xk +jW +l + 

The Fourier series is new. 

In particular, in the symmetric case the Fourier coefficients of the periodic function are 
given by 

- = - + log 2 + 2 > ^ ^ ^ ~ 4.352906698945400 

log2 log2 S ^ J 

and for 7^ 

log 2 4 ^ J'!(2^-1) 

The above numerical value for $i(— 1)/ log 2 is in accordance with that obtained in [70] where 
the authors derived the alternative expression 

1 1 2 ^(-l)i(j + l) 4tt 2 ^ j 



1 , _^ * f_ \^ u -1- _ ^1 \ ^ _ 



2 log 2 log 2 2 log 2^ j(2i-l) log 3 2 ^ sinh fg 
Equating them gives the same identity (34) as we encountered in the size of tries. 
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5.3 Radix sort 



Bucketing is a common design paradigm used for sorting or selecting elements with specified 
properties; see [14]. For sorting purposes, a simple procedure, called radix sort, is to distribute 
elements into b buckets according to their values and then sort within each bucket recursively; 
see [73, 76]. Since we can always normalize elements into the unit interval, splitting into b 
buckets amounts to using o-ary digit expansion of each element and then distribute according 
to the leading digits. Thus the radix sorting process induces a trie with up to b branches at each 
node. 

If we assume that the n elements to be sorted are independent and identically distributed 
uniform random variables (from the unit interval), then the cost X n of radix sort (number of 
digit extractions needed to sort) satisfies (see [76]) 



P(z, y) = (1 - e y)ze- z + e-^^P yY 



where P(z,y) := e~ z '^2 n>0 ^(e XnV )z n /n\. This is nothing but the Poisson generating func- 
tion for the external path length of random bucket tries with branching factor b (using o-ary 
expansion). All analysis above carries through and we have 

f 1 (z) = bf 1 (z/b) + z{l-e-), 

and the corresponding V(z) := f2(z) — fi{z) — zf[(z) satisfies 

V(z) = bV(z/b) + g{z), 

where 

g{z) := e~ z fobzf^z/b) + 2z{l - z)f[(z/b) + z{l - e~ z ) + z 2 e~ z {\ - z] 
Then the Mellin transform of g is given by 

G(s) = F{s + 1) (1 - 2-*- 1 - s2- s - 3 - s 2 2- s - 3 ) 

+3E ( i^!i ) +1 W +*)-i) w.»-2). (44 > 

It follows, by the same Mellin analysis and JS-admissibility, that (xk '■= 2A;7u/log b) 

— ^^g^+rr + J + r^ E nxk)n-*>° + o(i) 

n logo 2 logo 



V(X n ) 



1 J2G(-l + Xk)n-^+o(l). 



n log 6 

An expression for G(s) was derived in [76, p. 755], which is more messy than (44). Indeed, 
one can simplify that expression and obtain 

^( S ) _ i o-s-l „o-s-3 „2r 



1 _ 2- s - L - s2- s -- i - s z 2~ 



T(s + 1) 

- 2{s + l)(s + 2)U(s + 3) + 2(s + l)U(s + 2) + 2V{s + 1), 



31 



where (U(s) = V(s - 1) - V(s)) 

U(s) := ^b-\l + b 



— k\—s 



k^l 



Such an expression for G(s) is on the other hand also easily obtained from (44) by binomial 
theorem. 

In particular, by (44), 

G(-l) = \ + log2 + 2 ((&* + I)"' + log (1 + b- 
From this we obtain the following numerical table. 



b 


G(-l)/log&« 


2 


4.35290 66989 45400 60374 


3 


1.8083911899 9278196720 


4 


1.18266 25542 39848 25415 


5 


0.91013 8137749170 45524 


6 


0.75883 87760 90906 35697 


7 


0.66265 993661111750882 


8 


0.59600 35264 60033 23615 


9 


0.54696 00912 93530 34188 


10 


0.50926 083872624761651 



Thus increasing the number of buckets in radix sort reduces the variance of the cost, with the 
most drastic change from 2 to 3. 



5.4 Peripheral path length 

We define the peripheral path length of a tree as the sum of the fringe-sizes of all leaf-nodes, 
where the fringe-size of a leaf is defined to be the number of external nodes of the subtree 
rooted at its parent-node. This parameter was investigated in [16] where it was called the w- 
parameter. It was also studied in phylogenetics in the context of sum of all minimal clade sizes 
(see [4]). 



If we define T„ 



(T n \I n — k) 



n — 1, if k = l or k = n — 1 
0, otherwise, 



for n ^ 3 and with n—1 replaced by 2 for n = 2, then the peripheral path length X n of random 
tries of n keys satisfies (1) with the initial conditions X = and Xi — 1. 

Since T n depends on I n , such a parameter does not directly fit in our schemes. However, 
the same approach applies. The moment generating function of X n then has the recursive form 

M n (y) = K n>k M k (y)M n _ k (y) + e^^n (pq^ 1 + qp n ~ l ) M n ^(y), 



32 



forn ^ 2 with M (y) = 1 and M x {y) = e y . It follows that 

g t (z) = pqz 2 {e~ pz + e~ qz ) 

g 2 (z) = pqz 2 {2e~ z + (1 + qz)e~ pz + (1 + pz)e~ qz ), 

which are both exponentially small and JS-admissible by Proposition 3.2. The function h 2 (z) 
(see (22)) is now given by 



h 2 (z) = 2e z ^ 22 7r n,k(Hk + /i„-fc)E(T n |J n = k) 

n^2 0<j'<n 



n! 



pqjfa-ffiiqz) + 2e- qz Uvz) + 2e~ pz f[(qz) + 2e^'fi(pz) 
+ 2e~ pz + 2e~ qz " 



which is also JS-admissible by Proposition 3.2. This gives rise to the following expression for 
Mz) (see (24)) 



Mz) = ZPQz [ - e~ pz fi( P z) - e-^Mqz) + e~ pz + e^' 

+ (1 - 2p + pqz)e- qz f[(pz) + (1 - 2q + pqz)e~ pz f[(qz) 
- (2p-p 2 z)e~ pz f[(pz) - (2q-q 2 z)e- qz f'(qz)y 

Finally, 

V T (z) = pqz 2 (2(1 - Apqz + pqz 2 - p 2 q 2 z 3 )e~ z + (1 + qz)e~ pz + (1 + p,z)e~ 92: 
- pgz(4 + 2 — + p 2 z 2 )e~ 2pz - pqz(A + z - Aqz + q 2 z 2 )e~ 2qz 



All these functions are exponentially small for large \z\ with %t(z) > 0. 

Observe that Gi(s) := ^[gi; s) = pq(p~ s ~ 2 + q~ s ~ 2 )T(s + 2). An application of Theorem 
4. 1 then gives 

= 1 + I + HGxW \og 1/p n) + (1), 

where the additional term 1 on the right-hand side arises from the initial condition. 

Although Theorem 4.2 does not apply directly to the variance of X n , the same method of 
proof works well as in Theorem 4.2 part (b)-(i), and we obtain 

^ = ^ + nG]{r log 1/p n) + o(l), 

where a series-form for G(s) can be derived as the discussions above. For simplicity, we have, 
in the symmetric case, 



G{s) =s(s + l)r(s) (V +1 (s + 3) 



s 3 + 5s 2 + 22s + 24 



16 
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In particular, the average value of the periodic function is given by 

13 _ y (-lyjU 2 - 1) = 13 _ 12 y 1 Q 55730 49532 495Q5 

Note that we can also derive the identity 

(-1)^-1) _ 1 3 4vr 2 v fc((2^) 2 + (log2) 2 ) 
j£ 2i-l "log 2 8 (log 2)^ sinhfg 

the series on the right-hand side being smaller than 6 x 1CT 9 . 

5.5 Leader election (or loser selection) 

The coin-flipping process is applicable to single out a leader in real life or in abstract models: 
every individual involved throws a coin and those who get head continue until only one is left; 
see [92] . In this case, the approach we use so far leads to extremely simple forms for the number 
of coin-flippings; this example thus has a more instructional value. Let X n denote the total 
number of coin flipping s used in the leader election procedure of n people. Then X = X\ = 
and the exponential generating function P(z, y) := J2 n >o ^(e XnV )z n /n\ satisfies 

P(z,y) = (e*" 2 + 1) P (^,y) - e^ 2 + (1 - y)z. 

Instead of the usual Poisson generating function, we consider, as in [92], the Bernoulli 
generating function 

e z — 1 ' n\ 

Then A (0) = Oand 

f 1 (z) = f 1 (z/2)+z, 

which gives the identity f\{z) = 2z. Thus E(X n ) = 2n for n ^ 2. Also the normalized 
function V := f 2 — fl — z(f[) 2 satisfies 

V(z) = V(z/2) + z + 4^T- 

e z — 1 

Standard Mellin analysis yields 

v(z) = 2z + -^- + -^- C(2 + x fc )r(2 + x fc K^ + o(|^r 1 ), 

as \z\ —7- oo in the half -plane $t(z) > 0, where £(s) denotes Riemann's zeta function. Conse- 
quently, a similar de-Poissonization argument leads to 

^ = 2n + -^- + -^- J2 C(2 + X,)r(2 + Xfc K^+0(n- 1 ). 
s s fcez\{o} 

Thus, with an average of 2n coin-tossings and a y^-order of standard deviation, selecting a 
leader or a loser by such a naive splitting process is a very efficient procedure. 
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6 Further extensions 



Since BSPs appear in a large number of diverse contexts, many extensions of our frameworks 
are possible. We briefly discuss some examples in this section. 

6.1 Internal path length of random tries 

If, instead of summing over all the distances between the root and each external node (where 
records are stored), we add up all the distance between the root and each internal node, then 
we have the system of recurrences for the number of internal nodes N n (already discussed in 
Section 5.1) and the internal path length X n in a random trie of n elements under the Bernoulli 
model 



for n ^ 2, with initial conditions N = Ni = I = Ii = 0, where N* and X* are independent 
copies of N n and X n , respectively. 

We will see that the variance changes completely its asymptotic behavior and is asymptotic 
to n(logn) 2 weighted by a periodic function. This estimate is independent of the rationality of 
This was previously observed in [88] but with incomplete proof; see also the recent paper 
[44] for a study of Wiener index. 

The asymptotics of the variance can be addressed by the same approach we used for the 
node- wise path length of random digital search trees in [54]. We begin with the moment gen- 
erating function M n (u, v) = E(e NnU+XnV ), which satisfies the recurrence 



We then deduce that the Poisson generating functions of E(iV n ) and E(X n ), denoted by fifi(z) 
and fo,i(z), respectively, satisfy the functional equations 



Let f2,o(z), fi t i(z) and fo^z) denote the Poisson generating function of E(iV^), E(N n X n ) 
and E(JT^), respectively. Then we define the Poissonized versions of the variance and the 
covariance as 




M n (u,v)=e u n n)k M k (u + v,v)M n _ k (u + v,v) (n > 2). 



/i,o(z) = fiflipz) + f Xfi {qz) + 1 - (1 + z)e~ z 
fo,i( z ) = fo,i(P z ) + foAl z ) + h,o(P z ) + h,o(l z )- 



V(z) := f 2)Q (z) 
C(z) := A,i(z) 
W(z) := f 0>2 (z) 



/i,o(z)jo,i(X) - zR, (z)fo tl (z) 



A lengthy calculation then gives 



V{z) = V(pz)+V(qz)+g 2 ,o{z) 

C(z) = C(pz) + C(qz) + V(pz) + V(qz) + g hl (z) 

W(z) = W{pz) + W{qz) + 2C{pz) + 2C{qz) + V(pz) + V{qz) + g , 2 (z), 
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where 



+ l + z-(l + 2z + z 2 + z 3 )e~ z } +pqz (fiM) ~ /5,o(?*) 



2 



and 



g ltl (z) := e- 2 |(l + z) (j 1>0 (pz) + fi,o(qz) + / ,i(pz) + /o,i(gz)) 

- ^ 2 (p£ |0 M + QR,o(v z ) + PfaM) + <i(^)) } 

+ (/i,o(P*) - /i,o(?*0) (/i,o(P«) - /i,oM + /o,iW - /o,i(^)) 

and 

#0,2(2) := P?^ (/5,o(P*) - /i,o(?z) + /o,i(P z ) - /o,i(^)) • 

Then we have 

(a + l)r(a) 



^[A,o;s] 
-^[/o,i;s] 



1 — — 

(s + l)r(s)(j9- s + g- 



(l-p- s -g~ s ) 2 
It follows that (already derived in Section 5.1) 

E(N ) 1 

A_A_ = _ + ^[ Gl)0 ](r tog 1/p n) + 
where C?i,o(s) = — (s + l)r(s). Similarly, 

E(A-»)_/i + ^ |Gw](rlogi/p A>2p 



n 



p log 2 p + g log 2 q 7 — 1 1 1 



+ ^-^T^^ + — -T + r^IGo.xKrlog^n) + o(l), 



/i 3 ft 2 /i /i 

where (-0 being the derivative of log T) 

G„, 1(S ) = r(.) ((«.) + fc - Prtp + itfi j {1 + s) + 1] . 

By the same Mellin analysis, we obtain 

1 — p s — q s 

+ (1 - p - - q->)(G 2 (s) + H 2 (s)) 



JZ[W; s] = (1 _ p _!_ g _ s)3 (V s + q- s )(l + p- s + q- s )(®i(s) + 
+ 2(p~ s + - p~ s - g- s )(G 2 ( S ) + F 2 (s)) 

+ (i_p-_ r -)3£r s ( a )) > 
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where 



and 



G 2 {s) = Jt 
Hs(s) - 



92fl( Z , 

9iM 



-pqz (fio(pz) - K,M z ) 

- PQZ (flo(pz) - f' lfi (qz) + fo A (pz) - fo A (qz) I : .s 



PQZ \J[ fi (jpz) - fl Q (qz) 

P<iz (fi )0 (j>z) - f[ t0 (qz)) (fo tl (pz) - fo }1 {qz) I : .s 
PQZ (f'o,i(pz) - fo^(qz) 



From these functions, we can derive, by the same arguments we used above, asymptotic ap- 
proximations to the covariance of N n and X n , and the variance of X n . 

Theorem 6.1. The variance of the internal path length of random tries satisfies 

V(X n 



n 



F ,2(r log 1/p Ti 



^ + +i©Hog 1/p n) 1 °f + F$(Hog 1/pn) + o(i; 



and f/ie covariance of N n and X n satisfies 



Cov(iV„, X„_) = Fo2(Hogi/pn) lop + Fg(rlogi/pn) + o(1)j 



where Fq j2 (x) = G(—l)/h + ^[G](x) with G given in Section 5.1, and the other F.J 's are 

log q 



either constants when ^ Q or periodic functions with computable Fourier series when 



For simplicity, we give only the expressions in the symmetric case 

J?iO»0 = -j^~ 2 + X k) - G 2 (-l + X k) log2)e 



2knix 



fcez 



4%) 



*)- (1()g2i 



(log 2) 



J2(G[(-1 + Xk) ~ G 2 (-l + Xk) log2)e 



2kirix 



feez 



X —- £(G?(-1 + Xk) ~ 2G' 2 (-1 + Xk) log 2)e 



2kn.r 



where Ca(s) is given in (39) and 

-iyjr(s + j + r 



G 2 (s) = 



(V + 2)(j(j + l + s)-l). 



An intuitive interpretation of why the variance is of order n(logn) 2 is as follows. Any path 
from the root of length k to an internal node contributes 1 + 2 H h k = 0(k 2 ) to the internal 
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path length. Since the expected values of both internal and external path lengths are of order 
n log n, we see that most nodes lie at levels of order log n, and these nodes thus contribute an 
order n(logn) 2 to the variance. 

In a completely similar manner, if Y n denotes the peripheral path length where we change 
subtree-size to the sum of all internal nodes (instead of all external nodes), then we can derive 
the asymptotic approximations to the variance of Y n and the covariance of Y n and N n , which 
are both linear 



v(y„) 



n 



i/p ■ 



Cov(Y n ,N n ) [Y] \,f,\ 
- = Fi,i(rlog 1/p n) + o{l) 



n 



where the 's are either constants when ^ Q or computable periodic functions when 



6.2 Contention resolution in multi-access channel using tree algorithms 

There is an abundant literature on the subject and we are specially interested in the complexity 
of tree algorithms used in resolving the contention before either transmitting information to the 
common shared channel or performing certain tasks in a distributed computing environment. 
The tree algorithm (originally due to Capetanakis, Tsybakov and Mikhailov in the late 1970s) 
resolves the conflict (when more than one user is sending simultaneously his message to the 
common channel) by the outcome of a coin-flipping at each contender's site, similar to the 
splitting rule used for constructing a trie; see [3, 79, 83, 80, 103] for details. The analysis 
of the time needed for such algorithms to resolve the conflict of n contenders often leads to 
recurrences of the form (1) or its extensions. The expected value of the time to resolve all 
conflicts, which corresponds essentially to the size of random tries, has been widely addressed 
in the information-theoretic and communication literature, but there are very few papers on the 
variance; see [64, 65]. 

Consider the extended environment where each "coin" has r distinct outcomes with respec- 
tive probabilities Pi,...,p r , where Yli< m < r Pm — 1 an d none of them is zero. Then the time 
X n to resolve the collision of n contenders satisfies (see [80]) 

P(z,y) = e y J] P( Pm z,y) + (l-ey)(l + z)e- z , (45) 

where P(z,y) := e~ z Y2 n >o ^(e XnV )z n /n\. For simplicity, we consider a version with X = 
X\ = 0; the situation of nonzero initial conditions can be manipulated by extending the same 
arguments we use (only the mean will be altered, the variance remains the same). From (45), 
we obtain the functional equation for the Poisson generating function of E(X n ) 

/i(*)= E f 1 ( Pm z) + l-(l + z)e- z , 

with A(0) = 0, and, similarly, for V := f 2 - ]\ - z(j[)\ 

V(z)= V(p m z) + ~g(z), 

l<m<r 
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with V(0) = 0, where 

g(z) = e - z (l + z-(l + 2z + z 2 + z 3 )e~ z ) 

Z) - Z 2 ^2 Pmfl (Pn 
+ Z PmPl(f'x{PmZ) ~ /l(Pl*)) • 

Then all our analysis extends mutatis mutandis to these functional equations, and we have 
the following asymptotics for E(X n ) and Y(X n ). 
Let 

P{*) ■= E 

Then the entropy is 

h ■= --P'(l) = - E Pm lo gPm- 

As in the Bernoulli case, we need to distinguish between rational (periodic) and irrational 
(aperiodic) cases. The former is characterized either by the existence of a p 6 K such that 

Pn 



p £m , e m G N for 1 ^ m ^ r, or by the ratios 



log P; 



for all pairs (m,l). 



Theorem 6.2. 77ze expected value and the variance of X n (defined in (45) J can asymptotically 
be approximated by 

E{X n ) 1 



n 

YX r , 



+ F 1 (log 1/p n) + o(l), 

+ F 2 (log 1/p n) + o(l) 



where both Fx = F 2 



n h 
in the irrational case and (Xk = f^p) 

= l E X fc r(-1 + Xfe)e 2 
fcez\{o} 

F2 ^ = X ^ G(-l + X fcK 
fcez\{o} 

in the rational case, where G = s] is given in (46) below. 

While the dominant term involving the entropy for the expected value is well-known (see 
[5, 9, 62]), the corresponding term G(—l)/h for the variance is far from being intuitive. On the 
other hand, if we start with X = a and X\ = b, then 



2knix 



-Xk 



E(X„ 
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(b-a) + ((r-l)a+l)(-+F 1 (log 1/p n] 



o(l). 



The function G in the Theorem is described as follows. For 9?(s) > —2, 

s 2 + As + 8' 



G(s) = (s + l)r(s) 1- 



3>1 



2 s+3 

-'K/ (./{./ + S + 1) - l)T(j + S + l)P(j + 1) 
(j + 1)1(1 -P(j + 1)) 



(46) 
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where $ 2 (s) = if p m = 1/r for 1 ^ m ^ r (the symmetric case), and 

* i , v i_ r ( 2 +X*) oy (-l) j r(j + y fc + l)PQ- + l) 

— ^ r(%j + l)r(xfe_j + 1), in the rational case 
0, in the irrational case, 

in the asymmetric case. Consequently, 

Xk + 3' 



G(-i + Xfc) = x*r(-i + Xfc)(i 



2 1+ » 



, ( J -i)!( J + i)(i_P( J + i)) 



i ^ r(xj + l)r(xfe-j + 1), in the rational case; 
0, in the irrational case. 



which reduces to (35) and (36) in the Bernoulli model (r = 2). 
When p m = 1 jr for 1 ^ m ^ r 

/ c 2 _|_ Ac _|_ 

G(s) = (s + i)r(s) i - 1 



2 S + 3 

^ (_i)jjr(j + i + s) , v . 

+ 2 E^jtw^it ( ^ +1+s) - 1); 

compare (31) and (32). Thus the average value of the periodic function is given by 

G(-l) _ 1 2 (-l)^(fc-l) _ 1 2 1_ 

log 6 " 4 log b log 6^ 6 fc - 1 4 log b log 6 ^ (6 fc + 1) 

This is consistent with the expression derived in [64] 

1 1 2^1 4f 2 ^ i 

21og6~ (logfc) 2 + Iol6f^6 fc + l ~ (log 6) 3 ^sinh 2 ^' 

v y fe>l fc^l logo 

Equating the two expressions leads to the identity 

1 1 nST 1 1 ?V 1 4:7V 2 ^ k 

2~M + ^b k + l~4 + f-j + l) 2 + (log 6) 2 ^ sinh 2fc " 2 ' 



which generalizes (34). Our Fourier series for F 2 is new even in this simple case. 

For many other concrete examples, see [3, 55, 79, 80, 83, 103] and the references therein. 
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7 PATRICIA Tries 



In typical random tries, internal nodes at successive levels may have only one descendant (cor- 
responding to the extreme probabilities when binomial distribution assumes and n), resulting 
in an increase in storage. Indeed, the expected number // n of internal nodes under the initial con- 
dition iii — is asymptotic to (h^ 1 + JP{G](r \og l j p n))n (see Section 5.1). Thus the expected 
number of internal nodes with only one child is asymptotic to {h~ x — 1+JP[G] (r log 1 / p n))n. In 
the symmetric case, the leading constant (neglecting the fluctuation term) is about 1 / log 2— 1 « 
.4427, about 44% extra space being needed, and this is the minimum when p varies between 
and 1. The idea of PATRICIA 2 tries arose when there was a need to compress such a one-child- 
in- one-generation pattern; see [73, 84]. When removing all such nodes, the resulting tree has 
n — 1 internal nodes (for n external nodes). See [104] for an analysis connected to unary nodes 
of random tries, and [5, 15, 62, 69, 95] for other linear shape measures. 

Under the same Bernoulli model, we can construct random PATRICIA tries by using the 
same rule for constructing an ordinary trie but compress all internal nodes with only one de- 
scendant. If X n represents an additive shape parameter in a random PATRICIA trie of size n, 
then, for n ^ 2, 



where 



X n L X Ik + X* n _ Iln + T n , (47) 

m = k)=<, k == 1 ^p W g _ gW > (* = l,...,n-l), 

and the X*'s are independent copies of X n . Since we are mainly interested in the variance, 
we may assume that X = X\ = 0. This then translates into the recurrence for the moment- 
generating functions (assuming T n independent of X n ) 

M n (y) = E(e T ^) ^ 7r' n>k M k (y)M n _ k (y) (n > 2), 

lsgfc<n 

with M (y) = Mi(y) = 1. It follows that the Poisson generating function fx of E(X n ) satisfies 
the functional equation 

fx(z) = fxipz) + fx{qz) + gx{z) - e- qz Uvz) - e~ p ^i(gz), (48) 

with /i(0) = /((O) = 0, where gx represents the Poisson generating function of E(T n ). For 
convenience, we also assume gx(0) = g[(0) = 0. 

The same tools we developed for tries readily apply to (48) and the same asymptotic pattern 
holds. 

Theorem 7.1. Let < 6 < tt/2, a < 1 and e M. 
(a) If more precisely gx G J^ a B' t ^ ien 

E ^ = ^^ + ^[G](rio gl/p n) + o(l), 
n n ' 

where Gx(s) = ^[gx(z) — e~ qz gx{pz) — e^ pz gx(qz); s]. 



PATRICIA is the acronym of "practical algorithm to retrieve information coded in alphanumeric". 
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(b) If gi E J?y andgi(z) = cz + 0(\z\ a (log + \z\)^) uniformly for | arg(z)| ^ 9, then 

E ( X n) c d plog 2 p + qlog 2 q 

= - logn + - + — + J^[Gi](rlog 1/p n) + o(l), 

where Gi(s) is the meromorphic continuation of j$\g\{z) — e~ qz c/i(pz) — e~ pz gi(qz); s] 
and d = lim s _ s> _ 1 (G ! i(s) + c/(s + 1)). 

Since the method of proof is the same as that of Theorem 4. 1, we omit the details. 
For the variance of X n , we have, using the same notations, 

V x (z) = V x (pz) + V x (qz) + V T (z) + Mz) + Mz) + fa(z), 

where 

Mz) = -e'^hipz) - e- vz U^) + ^gi(z) (e'^ipz) + e'^qz)) 

- 2zg[(z) (qe- qz g x {jpz) + pe~ pz g l (qz) - pe~ qz g[(pz) - qe~ pz g[(qz)) 

- z (qe'^g^pz) + pe'P'g^qz) - pe- qz g[(pz) - qe~ pz g[(qz)) 2 

- (e-Vfafrz) + e- p %(qz)) 2 , 

and 



>i0) = h 2 (z) - 2g x {z) yfiipz) + fi{qz)J - 2zg[(z) (pf[(pz) + qf[(qz) 

+ 2 {e-Vgxipz) + e^Uqz)) [f x {pz) + ~h{qz)) 
- 2z (qe'^g^pz) + pe-^g^qz) - pe- qz g[(pz) - qe~ pz g[(qz)) 
x (pf[(pz) + qf[(qz) 



h{z) = pqz ( f[{pz) - f[(qz) 



Here h 2 is given by 



-2e- z J2(p n + q n )E(T n )E(X, 



Z" 



Note that, by Propositions 3.2 and 3.3, if g\ E J?Sf \ then f\ E J?Sf ', which in turn implies, 
by Proposition 3.5, that h 2 E JPS? . Consequently, if g x E jPSf 1 and g 2 E JPS? , then both 
fi E and f 2 E JS?. Thus our approach applies to Y(X n ). 

Theorem 7.2. Let < 9 < n/2,a < 1 and {3 E R. Assume g x ,g 2 E J?Sf and V T (z) = 
O (|z| Q (log + \z\)P) for | arg(z) | < 9. 

(a) Ifp = q = 1/2, and ~g x E JS?^ or g x E JS? XQ . Then 



V(X r , 



n log 2 
where G{s) = ^[V T (z) + 4> {z) + fa(z); s}. 



1 J2G(-l + Xk)n-* k +o(l), 
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(b) Assume p ^ q. 

(i) ¥ 9i e S-K,? then 

^ = ^ + ^[G](rlog 1/p n) + o(l), 

where G(s) = $i(s) + $2(5) wzY/z $i(s) = ^[V T (z) + O (^) + 0i(z)] $2(5) 
is an analytic continuation of j$\$>i\ s], 

(zz'J Ifg\{z) = z + 0(\z\ a (log + l-z])' 3 ) uniformly for | &rg(z)\ ^ 6 1 . 77zen 

V(X n ) pq\og 2 (p/q) d plog 2 p + q\og 2 q 
— = P log-+^ + W2 

+ &[G](rlog 1/p n) + o(l). 

Here G(s) = $i(s) + $2(5) with $i(s) a* above, $2(5) z's a meromorphic contin- 
uation of M\<\>2\ s] and d = lim s _ s> _ 1 (G(s) + pq log 2 (p/q)/(h 2 (s + 1))). 

The proof follows the same arguments as that of Theorem 4.2 and is omitted. 

Consider the external path length, which satisfies (47) with T n = n. In this case, we have 

Sfi(z) = z(l - e- z ), g 2 (z) = z{\ - e~ z ) + z 2 , 



and 
Also 



V T (z) = e~ z (z(\ - e~ z ) + z 2 (l - z)e 



Mz) = -2zpq {{zp - \)e**f[(pz) + (zq - l)e~«* f[(qz) 

+ (zp + l)e-* z fi(pz) + (zq + l)e- pz f[(qz) 
+ 2qze- pz f x (pz) + 2pze~ qz h(qz). 

Observe that 

:= J(%{z) - e-« z Uvz) - e- pz ~ 9l (qz); s] = -T(s + 1) [qp~ s ~ l + pq^ 1 ) . 
Thus, by Theorem 7.1, 

EpQ 1 7 plog 2 p + glog 2 g 

— — = -logn+- + — l + ^[G 1 ](rlog 1/p n) + o(l). 

Now by Theorem 7.2, the variance satisfies 

^ = ^ + Sf [G](r bg 1/p n) + o(l), 

where G = $1 + $2, as described in Theorem 7.2. Expressions can be derived for G. For 
brevity, consider only the symmetric case for which we have 

s 2 + 3s + 6 N 
4 

(-iyr( s +j + 2) 



G(s) = $x(s) = T(s + 1) \^2 s+1 (s + 2) - 
+ 2 S+2 ^ 



, (j-l)!(2>-l) 
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Note that the last series has the alternative form 



V (-l)T( 8 +j + 2) _ ys 1 



(j- l)!(2i - 1) v / ^2i(l + 2-^) 

Hence, the mean value of the periodic function is given by 

3 2 



y - 



4 log 2 log 2 ^ 2i(l + 2-J) 2 



0.36132 6059781678 



which is the same as that obtained in [69] with a different expression (equating our expression 
with theirs gives the same identity (34)). 



8 Conclusions 

The prevalent appearance in diverse modeling contexts and high concentration of the binomial 
distribution make BSPs a distinctive subject full of featured properties and numerous exten- 
sions. Periodic oscillation is among the phenomena for which analytic tools proved to be a 
successful bridge between theory and practical observations. The analytic methodology devel- 
oped in this paper, based largely on earlier works founded by Flajolet and his coauthors and 
aiming at clarifying the periodic oscillation of the variance, is itself easily amended for other 
circumstances, including particularly the case of quadratic shape measures such as the Wiener 
index (see [44]) or the analysis of partial-match queries (see [42]). The combination of Mellin 
analysis and analytic de-Poissonization (operated at the more abstract level of admissible func- 
tions) proves once again to be powerful tools for unriddling the intrinsic complexity of the 
asymptotic variance, and provides an efficient mechanical art of conjecturing and proving in 
more general contexts the structure of the variance. More developments will be discussed in a 
subsequent paper. 
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